A proposal for improving WordNet Domains
نویسندگان
چکیده
WordNet Domains (WND) is a lexical resource where synsets have been semi-automatically annotated with one or more domain labels from a set of 165 hierarchically organized domains. The uses of WND include the power to reduce the polysemy degree of the words, grouping those senses that belong to the same domain. But the semi-automatic method used to develop this resource was far from being perfect. By cross-checking the content of the Multilingual Central Repository (MCR) it is possible to find some errors and inconsistencies. Many are very subtle. Others, however, leave no doubt. Moreover, it is very difficult to quantify the number of errors in the original version of WND. This paper presents a novel semi-automatic method to propagate domain information through the MCR. We also compare both labellings (the original and the new one) allowing us to detect anomalies in the original WND labels.
منابع مشابه
Enriching the Integration of Semantic Resources based on WordNet
In this paper we present the enrichment of the Integration of Semantic Resources based in WordNet (ISR-WN Enriched). This new proposal improves the previous one where several semantic resources such as SUMO, WordNet Domains and WordNet Affects were related, adding other semantic resources such as Semantic Classes and SentiWordNet. Firstly, the paper describes the architecture of this proposal e...
متن کاملWordNet and Automated Text Summarization
Proposals for text classification and information retrieval have been recently presented making use of the WordNet ontology. Generally, this methodology requires statistical induction of synset clusters and entails costly training of specific key domains. The present proposal intends to show that a simple recursive evaluation procedure and WordNet are rich enough to obtain useful results in tex...
متن کاملA graph-Based Approach to WSD Using Relevant Semantic Trees and N-Cliques Model
In this paper we propose a new graph-based approach to solve semantic ambiguity using a semantic net based on WordNet. Our proposal uses an adaptation of the Clique Partitioning Technique to extract sets of strongly related senses. For that, an initial graph is obtained from senses of WordNet combined with the information of several semantic categories from different resources: WordNet Domains,...
متن کاملEnhancement Electronic evaluation for Semantic Arabic Oral Exam
From the importance of knowledge in the speech, we knew the importance of oral exam. So in this paper we integrated BOW (Bag of Word), LSA(Latin Semantic Analysis), ASR (automatic speech recognition), zero crossing rate, and Ontology based approach to automate the online oral exam especially in Arabic language with take into consideration the authentication problem. Our proposal method faced ma...
متن کاملImproving Semantic Knowledge Base for Transfer Learning in Sentiment Analysis
Sentiment analysis deals with the computational treatment of opinion, sentiment, and subjectivity in text, has attracted a great deal of attention. Sentiment analysis has been widely used across a wide range of domains in recent years, such as information retrieval, question answering systems and social network. This paper presents a new method for improving the semantic knowledge base for sent...
متن کامل